How to Annotate Linguistic Information in FILES and SCAT
نویسنده
چکیده
We present a suite of applications used for the Italian Treebank which share their linguistic processor and end up finally in higher level annotation tool called “FILES”. The first application “FILES” – Fully Integrated Linguistic Environment for Syntactic and Functional Annotation is a prototype for a fully integrated linguistic environment for syntactic functional annotation of corpora. It takes as input tagged and disambiguated tokenized texts, a file containing the same text split into sentences, and a files containing the morphosyntactic and semantic features associated to each tagged token. Tokens may be aither single words, polywords, abbreviations or punctuation marks. An as yet separated module of “FILES” is the syntactic constituency annotation environment which uses a shallow parser on the same tagged files and produces a fully bracketed output where each sentence is a record. Files contaning bracketed sentences are given as input to Syntactic Constituency Annotation Tool “SCAT” for manual verification.
منابع مشابه
A Study of the Relationship between Acoustic Features of “bæle” and the Paralinguistic Information
Language users benefit from special phonetic tools in order to communicate linguistic information as well as different emotional aspects and paralinguistic information through daily conversation. Having functions in conveying semantic information to listeners, prosodic features form the essential part of linguistic behavour, manipulating them potentially can play an important role in transmitt...
متن کاملBourdieu and Genette in Paratext: How Sociology Counts in Linguistic Reasoning
While Bourdieu’s theory of practice provides an ensemble of conceptual tools which analyze patterns of social life that are irreducible to the limiting view of individuals as free-acting agents, Genette’s paratextual theory offers the metalanguage necessary to account for the microcosm of paratext as a linguistic space. This study takes issue with unidirectional approaches to researching parate...
متن کاملInter-Annotator Agreement on a Linguistic Ontology for Spatial Language - A Case Study for GUM-Space
In this paper, we present a case study for measuring inter-annotator agreement on a linguistic ontology for spatial language, namely the spatial extension of the Generalized Upper Model. This linguistic ontology specifies semantic categories, and it is used in dialogue systems for natural language of space in the context of human-computer interaction and spatial assistance systems. Its core rep...
متن کاملA new method for venom extraction from venomous fish, Green Scat
Scatophagus argus argus (Green Scat) is a pretty aquarium fish. Its hard spines are venomous and can cause painful injury. In this study 60 specimens of Green Scat were collected periodically from coastal waters of Boushehr (south of Iran) from May 2011 to April 2012. Anatomical features of venomous spines were investigated. Scat venom was extracted from the spines in a new manner for keeping t...
متن کاملHow To Integrate Linguistic Information In FILES And Generate Feedback For Grammar Errors
We present three applications which share some of their linguistic processor. The first application “FILES” – Fully Integrated Linguistic Environment for Syntactic and Functional Annotation is a fully integrated linguistic environment for syntactic and functional annotation of corpora currently being used for the Italian Treebank. The second application is a shallow parser – the same used in FI...
متن کامل